-
Notifications
You must be signed in to change notification settings - Fork 3k
Flink: add flink-runtime module #1423
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
I used this patch to copy all the dependencies from Here are the dependencies: ➜ dependencies git:(flink-runtime) ✗ ls -atlr
total 72120
drwxr-xr-x 6 openinx staff 192 Sep 4 17:35 ..
-rw-r--r-- 1 openinx staff 70648 Sep 4 17:35 iceberg-data-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 80259 Sep 4 17:35 iceberg-orc-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 269140 Sep 4 17:35 iceberg-parquet-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 31033 Sep 4 17:35 iceberg-hive-metastore-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 723066 Sep 4 17:35 iceberg-core-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 341213 Sep 4 17:35 iceberg-api-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 31339 Sep 4 17:35 iceberg-common-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 587956 Sep 4 17:35 avro-1.9.2.jar
-rw-r--r-- 1 openinx staff 1781264 Sep 4 17:35 orc-core-1.6.3-nohive.jar
-rw-r--r-- 1 openinx staff 28953 Sep 4 17:35 orc-shims-1.6.3.jar
-rw-r--r-- 1 openinx staff 19512529 Sep 4 17:35 parquet-avro-1.11.0.jar
-rw-r--r-- 1 openinx staff 437637 Sep 4 17:35 parquet-hadoop-1.11.0.jar
-rw-r--r-- 1 openinx staff 1619439 Sep 4 17:35 parquet-column-1.11.0.jar
-rw-r--r-- 1 openinx staff 848718 Sep 4 17:35 parquet-encoding-1.11.0.jar
-rw-r--r-- 1 openinx staff 96197 Sep 4 17:35 parquet-common-1.11.0.jar
-rw-r--r-- 1 openinx staff 700667 Sep 4 17:35 parquet-format-structures-1.11.0.jar
-rw-r--r-- 1 openinx staff 41203 Sep 4 17:35 slf4j-api-1.7.25.jar
-rw-r--r-- 1 openinx staff 15322 Sep 4 17:35 findbugs-annotations-1.3.9-1.jar
-rw-r--r-- 1 openinx staff 1634977 Sep 4 17:35 iceberg-bundled-guava-746c0c8.dirty.jar
-rw-r--r-- 1 openinx staff 1403755 Sep 4 17:35 jackson-databind-2.10.2.jar
-rw-r--r-- 1 openinx staff 349074 Sep 4 17:35 jackson-core-2.10.2.jar
-rw-r--r-- 1 openinx staff 846706 Sep 4 17:35 caffeine-2.7.0.jar
-rw-r--r-- 1 openinx staff 615064 Sep 4 17:35 commons-compress-1.19.jar
-rw-r--r-- 1 openinx staff 193908 Sep 4 17:35 checker-qual-2.6.0.jar
-rw-r--r-- 1 openinx staff 13161 Sep 4 17:35 error_prone_annotations-2.3.3.jar
-rw-r--r-- 1 openinx staff 181098 Sep 4 17:35 aircompressor-0.15.jar
-rw-r--r-- 1 openinx staff 102244 Sep 4 17:35 jaxb-api-2.2.11.jar
-rw-r--r-- 1 openinx staff 19014 Sep 4 17:35 annotations-17.0.0.jar
-rw-r--r-- 1 openinx staff 233745 Sep 4 17:35 threeten-extra-1.5.0.jar
-rw-r--r-- 1 openinx staff 68080 Sep 4 17:35 jackson-annotations-2.10.2.jar
-rw-r--r-- 1 openinx staff 1786250 Sep 4 17:35 parquet-jackson-1.11.0.jar
-rw-r--r-- 1 openinx staff 2021167 Sep 4 17:35 snappy-java-1.1.7.3.jar
-rw-r--r-- 1 openinx staff 111119 Sep 4 17:35 commons-pool-1.6.jar
-rw-r--r-- 1 openinx staff 26586 Sep 4 17:35 javax.annotation-api-1.3.2.jar
drwxr-xr-x 37 openinx staff 1184 Sep 4 17:35 .
-rw-r--r-- 1 openinx staff 20924 Sep 4 17:35 audience-annotations-0.11.0.jar |
|
Do we expect the Flink cluster has hive execution environment like Spark cluster? In our Flink cluster, we don't have a hive environment so that we require users to upload the application jar along with a hive-exec jar. The spark cluster doesn't need this since it has built-in hive jars. Thoughts? |
|
I'm not quite sure whether we need to pack the hive-exec jar within the flink-runtime jar now. Seems it's a fat jar and include an old version guava dependency ( just notice the comment in build.gradle), I will try the demo under hive environment. |
|
I read the apache flink hive document here, It's recommended to use the bundled So I think we don't need to include the |
|
Thanks @openinx! I wasn't aware Flink has this bundle jar. |
|
@kbendick, should this be labelled |
Hmm yes it definitely should be. I have |
|
Thanks, @openinx! Looks good now. |
This patch will create a separate flink runtime module named
flink-runtime, it will shade the common dependency jars and archive all flink connector related classes into a jar. Now I have the basic verification under my localhost as the following:wget https://www.apache.org/dyn/closer.lua/flink/flink-1.11.1/flink-1.11.1-bin-scala_2.12.tgz tar xzvf flink-1.11.1-bin-scala_2.12.tgz cd flink-1.11.1We will see the runtime jar located like:
It will need still more work to fill the LICENSE and NOTICE in
flink-runtimemodule, and I will test more cases to confirm whether it works well.